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Abstract 

We show that there exists a very natural, superstatistics-linked extension of the central limit theorem 
(CLT) to deformed exponentials (also called q-Gaussians): This generalization favorably compares with 
the one provided by S. Umarov and C. Tsallis [arXiv:cond-mat/0703533 ], since the latter requires a special 
"q-independence" condition on the data. On the contrary, our CLT proposal applies exactly in the usual 
conditions in which the classical CLT is used. Moreover, we show that, asymptotically, the q-independence 
condition is naturally induced by our version of the CLT. 
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I. INTRODUCTION 



The central limit theorems (CLT) can be ranked among the most important theorems in prob- 
ability theory and statistics and plays an essential role in several basic and applied disciplines, 
notably in statistical mechanics. Pioneers like A. de Moivre, PS. de Laplace, S.D. Poisson, and 
C.F. Gauss have shown that the Gaussian function is the attractor of independent additive contri- 
butions with a finite second variance. Distinguished authors like Chebyshev, Markov, Liapounov, 
Feller, Lindeberg and Levy have also made essential contributions to the CLT- theory. 

The random variables to which the classical CLT refers are required to be independent. Sub- 
sequent efforts along CLT lines have established corresponding theorems for weakly dependent 
random variables as well (see some pertinent references in Jlll^S])- However, the CLT does not 
hold if correlations between far-ranging random variables are not negligible (see 

Recent developments in statistical mechanics that have attracted the attention of many re- 
searches deal with strongly correlated random variables ([5] and references therein). These cor- 
relations do not rapidly decrease with any increasing distance between random variables and are 
often referred to as global correlations (see [6] for a definition). Is there an attractor that would 
replace the Gaussians in such a case? 

The answer is in the affirmative, as shown in , with the deformed or q-Gaussian playing 

the starring role. It is asserted in [2] that such a theorem cannot be obtained if we rely on classic 
algebra: it needs a construction based on a special algebra, which is called q-algebra 1150. The 
goal of this communication is to show that a q-generalization of the central limit theorem becomes 
indeed possible and in a very simple way without recourse to q-algebra. 



A. Systems that are q-distributed 

Consider a system S described by a random vector X with d— components whose covariance 
matrix reads 

K = (XX 1 ) = EXX\ (1) 

the superscript t indicating transposition. We say that X is q— Gaussian (or deformed Gaussian-) 
distributed if its probability distribution function writes as described by Eqs. CO)-© below. 
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in the case 1 < q < |±| 



i 



r 4t 

f x , q (X) = — (1 + x l hr l x) ^ , (2) 

r \-^t — 1 1 kAI 1 / 2 
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with matrix A being related to K in the fashion 

A = (m - 2) K. (3) 

J~l 

The number of degrees of freedom m is defined in terms of the dimension d of X as D7|] 

2 

m = d. (4) 

g - 1 



in the case q < 1 



r (2=2 + <A 

/** (X) = 2J (1 - ^E" 1 *)^ , (5) 



where the matrix £ is related to the covariance matrix via S = pfT. We introduce here a 
parameter p defined as 

V = 2^ + d, . (6) 
1 - q 

H. THE ROAD TOWARDS A NEW CLT 

As stated above, several attempts to generalize the central limit theorem (CLT) have been pub- 
lished recently , the aim being to have the Gaussian attractor replaced by the q-Gaussian 
attractor. We recall here the standard multivariate version of the CLT 

Theorem 1. Let Xi,X 2 , . . . be independent and identically distributed (i.i.d.) random vectors in 
M. d with expectation E [Xi] = and covariance matrix E [XiXf\ = K and let 



1 n 

^ = ^7$>- ( 7 ) 



n 

i=i 



Then W n converges weakly to a Gaussian vector W with covariance matrix K, or equivalently 



stated nidi 



i r* 1 r td x * K -h 



VteR d , lim Pr{W ra <t} = $i(t)= , 27rir|1/2 / •••/ e ~dX. (8) 



oo J — oo 
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The basic idea leading towards non-conventional CLTs is to find conditions under which con- 
vergence to the usual normal cumulative density function (cdf) $1 with covariance matrix K can 
be replaced by convergence to a q— Gaussian cdf 

%{t)= [ ■■■[" fx, q (x) dxt... dx d (9) 

J — oo J — oo 

with q > 1, /x o as defined in © and parameter m defined by © or, for q < 1, 



$ 9 (t) = / . . . / / x>g (x) dxi . . . cfe d (10) 

J —oo J —oo 

with as defined in © and parameter p defined by ©. We note that both cases m — > +oo and 
p — > +oo correspond to convergence q — > 1 to the Gaussian case. 

In two recent contributions, S. Umarov and C. Tsallis highlight the existence of such a central 
limit theorem, in the univariate [2] and multivariate [1] case, provided there exists a certain kind 
of dependence, called q— independence, between random vectors Xj. This q— independence con- 
dition is expressed in terms of the notions of q— Fourier transform F q and of q— product ® 9 
as 



F q [X 1 + X 2 ]=F q [X 1 ] ® q F q [X : 

which reduces to conventional independence for q — 1. 
We recall that the g— product of a; G C and y e C is 
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x ® g y = [x l - q + y x ~ q - l) 1 -« 



and the q— Fourier transform of a function / (x) , x E M. d , is 

F q [f] (0 = / {f 1 -" (x) + (1 - q) ix l i) ^ dx. 

However, this approach suffers from the lack of physical interpretation for such special depen- 
dence; moreover, the q— Fourier transform is a nonlinear transform (unless q = 1) what makes its 
use rather difficult. 

Another approach, as described in [8|], consists in keeping the independence assumption be- 
tween vectors Xi while replacing the n terms in © by a random number iV [n) of terms. That is, 
if the random variable N [n) follows a negative binomial distribution so as to diverge in a specified 
way, then convergence to a q— Gaussian distribution occurs whenever convergence occurs in the 
usual sense. 
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In the present contribution we show that there exists a much more natural way to extend the 

n o 

CLT, based on the Beck-Cohen notion of superstatistics D9|] (see the discussion in HOD). Our 
starting point is the same as that in Umarov's approach (i.e., assuming some kind of dependence 
between the summed terms). However, the manner in which we introduce this dependence among 
data is a natural one that can be interpreted in the physical framework of the Cohen-Beck physics 
(see [14] for an interesting overview). 

DDL PRESENT RESULTS 

Our present results can be conveniently condensed by stating two theorems, according to the 
value of parameter q. The essential idea is that of suitably introducing a chi-distributed random 
variable a that is independent (case q > 1) or dependent (case q < 1) of the data JQ, and then 
constructing the following scale mixture (typical of superstatistics JloTl ) 

1 " 

Z n = —Y^X l . (11) 

A. The case q > 1 

Theorem 2. If X\, X2, ■ ■ ■ are i.i.d. random vectors in M. d with zero mean and covariance ma- 
trix K, and if a denotes a random variable chi-distributed with m degrees of freedom, scale 
parameter (m — 2)~ 1 / 2 , and chosen independent of the X it then random vectors 



1 n 

■7=7 £** (12) 



v i=i 

converge weakly to a multivariate q— Gaussian vector Z with covariance matrix K. Equivalently 
stated: 

Vt G R d , lim Pr {Z n < t} = $ 9 (t) ; (13) 

n— >+oo 

with cdf§ q (t) defined as in ([9]). Moreover, 

m + d + 2 

q = — — > 1. (14) 

m + a 

Proof. First we note that the \~ density with m degrees of freedom and scale parameter ls 



Now, by the multivariate central limit theorem [T] above Ill7ll 

1 n 



i=i 



where A^ is a normal vector in M d with covariance matrix K. Applying from 111 ill its result [Th. 
2.8] we deduce that 

N 
a 

where — follows a q-Gaussian distribution with covariance matrix K and parameter q defined by 
©. □ 



B. The case q < 1 

The extension of theorem[2]to the case q < 1 proceeds as follows. 

Theorem 3. If X\, X%, ■ ■ ■ are i.i.d. random vectors in M. d with zero mean and covariance matrix 
K, and if a is a random variable independent of the X{ that is chi-distributed with m degrees of 



freedom and scale parameter \/m — 2, then the random vectors 

1 - 
Y n = -r-j= 2, Xi 

V 1 = 1 

with 



(15) 



\ 



(16) 



converge weakly to a multivariate q— Gaussian vector Y with covariance matrix K and distribu- 
tion function given by (|9]). Moreover, 

m — 4 



q 



< l. 



m — 2 

Proof. If Z has a characteristic distribution function (cdf) given by ©, then [8D 

Z 



(17) 



Y = <f>(Z) 



Vi + zW^z 

has cdf given by ([i~0)> . Since the function = M d — > {Y e M. d |F*A _1 F < 1} is continuous, 
the desired result is deduced by application of the continuous mapping theorem (see from ||l2|l its 
Theorem 2.3, p.7). □ 
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Remark 1. We note that Y n in 4771) is a normalized version of Z n in ( f72l) ; however, the fluctuation 
term a is replaced by a fluctuation term 




that involves the value of the sum itself - and thus is not independent of this sum anymore. Thus the 
case q < 1 can be considered as a fluctuating version of the usual CLT for which the fluctuation 
depends of the state of the system. Moreover, it is clear that as n increases, the distribution of the 
fluctuation b gets closer to a chi distribution with m + d degrees of freedom. 



C. Link with q— independence 

Although the extension of the CLT proposed above differs from the ones developed in a 
link can be established between both approaches for large values of n and for q > 1 as follows. 
Note that we assume q > 1 in the rest of the paper. 

Theorem 4. (linking theorem) Assume 1 < q < 1 + |. Consider n = n + ni together with the 
division of sum Z n in 4721) into two parts as 

1 / n ° n \ 

z » = 7Tk E*+ E * = z ™ + z « 2> ' <18) 

V \i=l i=n +l J 

Assume that the characteristic function (f) of Xi is such that L d \(p\ u dt < oo for some u > 1, and 
that data X{ are symmetric (X{ and —Xi have the same distribution). Then random vectors Z^ 

(2) 

and Zn are asymptotically q— independent in the sense that 

Ve > 0, 3N such that n > N,m> N ^ WF^Z^ + Z n 2) ] - F q [Z^\ ® qi F q [Z n 2) ] \ \ x < e 
with q x = z(q) = ygjg . 



For didactic reasons we postpone the proof of this result until next Section. We deduce from 
it that, asymptotically, the CLT theorem © exactly generates the ^-independence condition re- 
quired for application of the particular CLT version proposed in 



he q—h 
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IV. PROOF OF THE LINKING THEOREM 



A. Introduction 

In order to simplify the proof we will assume that vectors Xj verify a stronger version of the 
CLT than the one stated in theorem [T} namely the CLT in total variation. Now, the total variation 
divergence between two probability densities / and g is 

<W (/,*) = I [ \f-g\- (19) 



2 „ 

If U and V are random vectors distributed according to / and g respectively, we will denote 

d TV (U,V) = d TV (f,g). 



The total variation version of the CLT writes as follows (see Ill2h Th. 2.31.) 



Theorem 5. ( CLT in total variation) Assume that Xi, X 2 , . . . are i.i.d random vectors ofM> d with 
zero expectation, covariance matrix K and characteristic function such that J \ <p\ u dt < oo for 
some v > 1. If W n = Y^l=i an d W is a normal vector in M. d with covariance matrix K 
then 

lim d TV (W n , W) = 0. (20) 

n— >+oo 



Let us introduce the following notations: Z n denotes a version of sum (1121) where all X, are 
replaced by i.i.d. Gaussian vectors iVj E M. d with covariance matrix K, i.e., 



1 / no n \ 

V \j=i i=n +l / 



The proof of theorem[4]is based on the fact that vectors Zn and Z^f 1 are exactly q— independent (as 
seen in subsection llV Bl below). Since n is large, according to the above total variations theorem[5l 
Zn^ and Zn^ are close to their q— Gaussian counterparts Z^P and zj 2 , respectively (see Lemma 
IV B below). It remains to check that closeness between these vectors can be stated in terms of 
their q— transforms. We proceed in five steps, that invoke technical lemmas that are the subject of 
Subsection C below. These steps are: 

• step 1: components Zn^ and Z^ ase, exactly q— independent, as is proved in Thm. [6] of 
sub section |IVB] below. 
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• step 2: let us fix e > 0, and write 

\\F q [Z^ + Z^\ - F q [Z$] ® ft FJZ^JIU 

< ||-Fg[Zn ^ + Zn ] — F q [Z n ^ + Z« ]||oo 

+ ® gi i^ 2) ] - F q [zP] ® 9l Fjzi 2) ]|U 

• step 3: the first term \\F q [zjP + zj?] - F q [z£ ] + Z®}^ = \\F q [Z n ] - F^Z^ can be 
bounded as follows 

H-Fj-Zn] — i^fZJUoo < 2d T v(Z n , Z n ) < 2d TV (X n , X n ) 

where the first inequality follows from Lemma |3] and the second one from Lemma [TJ below. 
Thus a value jVj can be chosen so that n > Ni and ri\ > Ni ensure that this term is smaller 
than |. 

• step 4: the second term \\F q [Z^} ® qi F q [z£ ] ] - F q [Z^} ® 9l Fj^ 2) ]||oo can be bounded 
by applying Lemma 0] for a large enough value of n = n + n\, say n > N 2 , we have 

ll^t^] ®« ^[^ 2) ] - ^[^] ®* ^[^ 2) ] Hoc < 2rf T v(^ 1) , Z«) + 2d TV (Z^\ Zg>) 

Finally, from the total variation CLT, there exists a value N 3 such that n > N 3 and n\ > N 3 
implies that each of both total variation divergences is smaller than |. 

• step 5: The consideration of iV = max(iVi , N 2 , N 3 ) is then seen to prove the linking theorem 

m 

We turn now our attention to those results that we have used in this proof. 



B. Components of q— Gaussian vectors are q— independent 

We first begin to check that "sub-vectors" extracted from q— Gaussian vectors are exactly 
q— independent; this results is obvious from the fact that, by the CLT given in yj] (Thm. 4.1), 
these sub-vectors can be considered as limit cases of sequences of q— independent sequences. 
However, the mathematical verification of this property is of an instructive nature and we proceed 
to give it. For readability, we will say that X ~ (q, d) if X is a q— Gaussian vector of dimension d 
and nonextensivity parameter q. 
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Theorem 6. If 1 < qo < 1 + ^ and vector X = [X[, X^f ~ (go, 2d) with parameter g > 1 then 
vectors X x ~ (g, d) and X 2 ~ (g, d) and dzey are q— independent: 

F q [X x + X 2 ] = F q [Xi] ® w F g [X 2 ] (21) 

Proof. Since Xi ~ (g, d), we know from the Corollary 2.3 of [1] that F q [Xi] ~ (gi, d). Moreover, 
since X x and X 2 are components of the same g— Gaussian vector, from [8] we deduce that X x + 
X 2 ~ (g, d) so that F q [X x + X 2 ] ~ (gi, d). Finally, it is easy to check that since F q [X{\ ~ (g 1; d) 
and F q [X 2 ] ~ (gi, d) then F g [Xi] ® 91 F q [X 2 ] ~ (gi, d). The fact that both terms have same 
covariance matrices is straightforward, what proves the result. □ 



We note that q— correlation (|2Tj) corresponds to q— independence of the third kind as listed in 
Table 1 of yj]. We pass now to the consideration of the four Lemmas invoked in the proof of the 



linking theorem. 



C. Technical lemmas 

As we are concerned with scale mixtures of Gaussian vectors, we need the following lemma. 

Lemma l.IfU and V are random vectors in IR d and a is a random variable independent of U 
and V then 

drv (-,-) <d TV (U,V). (22) 



a a 

Proof. The distributions of scale mixtures U/a and V/a write, in terms of the distributions of U 
and of V, in the fashion 

fu/a {x) = I -if a {a) fu (-) da, g v/a (x) = [ \f a (a) f v (-) da. (23) 
Jm.+ a v a/ J R + a \a/ 
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It thus follows that 



fU V\ If 

dTv[—,— \ = o / \fu/a (X) ~ fv/a (x) \dx 

\a a J 2 J Rd 

= \ L 1 L ^ fa (a) ( fu © ■ fv ©) Mdx 
- \ LL^ fa(a)ifi r © i_ ' h '0 idadx 

= o / fa (a) da I \fu(z)- f v {z)\a d dz 
= \( fa (a) da [ \fu(z)-f v (z)\dz 
= \ I \fu-fv\=d TV (U,V). 

We also needed above the following 
Lemma 2. For q > 1 and 3? (z) > 0, the function 



i 



where 



□ 



z's a Lip schitz function with unit constant: 

\*Pq,z (Xl) ~ ^q,z (X ) \<\Xi- X \, (24) 

Proof We have 



\lpq,z {Xl) ~ lpq, z (X ) \ < SUp \lf/ q>z (x) | |xi - X \ , (25) 

XQ<X<X1 



(l + zx?- 1 )^ 1 



with > 0, so that, since x > and 3? (z) > 0, 

W q , z (x) | = l — -33 < 1. (27) 

□ 

Two straightforward consequences of such inequality are the following lemmas, that we have 
also used above. 
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Lemma 3. For any random vectors U and V,ifq>l, the following inequality holds 



\F q [U}-F q [V] \\ 00 <2d TV (U,V) 



(28) 



Proof. This result is a straightforward consequence of inequality (34) of reference yj]. However, 
an elementary proof writes as follows: denote fu and fv the respective probability densities of U 
and V. Then, V£ G M. d , 

\F q [U}(0-F q [V](0\ 
< /r- I {flT 9 (*) + (1 - ?) »V0^ - {fv" (*) +(l-q) ix 1 ^ \dx 

As 9ft ((1 — g) ix^) = and > 0, by lemma[2l the integrand is bounded by \f v (x) — f v (x) |; 
since this holds V£ e M. d , the desired result follows. □ 

We remark here that inequality (|28l) is a simple generalization of the well-known q — 1 case, 
in which F 9= i corresponds to the classical Fourier transform. Thus a well-known result of the 
Fourier theory is reproduced, namely 

[E7] - F 1 [V] \\oo<2d TV (U,V). 



As another consequence of lemma [2] we have 
Lemma 4. For notational simplicity, let us denote as Z x 

~ (2) 

Z 2 = Z n those random vectors defined in part IV.A. Then, for n large enough, 



7 (i) 7 

An , Z/ 2 



7 (2) y 



Zn and 



\F q [Z x \{£) ® qi F q [Z 2 \{S)-F q [Z x \(Z) ® qi FjZ 2 ](0||oo < 2d TV (Z 1 ,Z 1 ) + 2d TV (Z 2 ,Z 2 ). 



Proof. For any £ e 



< |F ff [ZJ(e) ® ?1 F ff [Z a ](0 - F q [Z x m ® qi F q [Z 2 ](0 
® 91 F s [Z 2 ](0 - ^[^iKO ®* F q [Z 2 ](0 

+ lV , g 1 ,F^a[^](O-i( F 9[ Z2 ]^))-^gi,F^*[^](O-i( F J^2](0) 

Since Z 2 is g— Gaussian, and since 1 < g < 1 + |, there exists an a 2 > (as given in 
equation (15) of reference yj]) such that F q l ~ qi \Z 2 ]{i) — 1 = a 2 (gi — 1)£ 2 so that, since gi > 
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0, it follows that F q ~ qi [Z 2 \ (£) > 1. From the CLT in total variation, we can choose n large 
enough so that d TV (F q [Z 2 ], F q [Z 2 }) is arbitrarily small, which in turns implies, by Lemma[3l that 
|-F g [Z 2 ](£) — F ? [Z 2 ](£)I 1S arbitrarily small as well. By continuity of the function x t— > x^ qi — 1, 
and since F q [Z 2 ] is real- valued by the symmetry of the data, this ensures that F q ~ qi [Z 2 ] (£) — 1 > 0. 
Thus, the first term can be bounded using lemma[2]in the fashion 

\^ qi ,Ft qi [Z 2 U)-l{ F <l\. Z ^)) ~ ^ qi ,Ft qi [Z 2 m-l (^i](0)i<i^[^i](0-W](e)i- 

Accordingly, since Z\ is q— Gaussian, there exists a x > such that F q l ~ qi [Zi](^) — 1 = ot\{q\ — 
1)£ 2 , hence F q ~ qi [Zi](£) > 1. Recourse again to lemma [2] yields 

(F q [Z 2 ](0)\ < \F q [Z 2 ](0 - F q [Z 2 ](0\. 
Applying now lemma [3] to each of both terms above yields 

\F q [Zi]{£) ® gi F q [Z 2 }(0 - F q [Z x ]{i) ® qi F g [Z 2 m\ < 2d TV (Z 1 , Z x ) + 2d TV (Z 2 , Z 2 ). 
As this holds for any value of £ G C, the result follows. □ 

V. CONCLUSIONS 

We have here dealt with non-conventional central limit theorems, whose attractor is a deformed 
or q-Gaussian. Based on the Beck-Cohen notion of superstatistics [9], with scale mixtures relating 
random variables a la Eq. (fTTT) . it has been shown that there exists a very natural extension of the 
central limit theorem to these deformed exponentials that quite favorably compares with the one 
provided by S. Umarov and C. Tsallis [arXiv:cond-mat/0703533J. This is so because the latter re- 
quires a special "q-independence condition on the data". On the contrary, our CLT proposal applies 
exactly in the usual conditions in which the classical CLT is used. However, links between ours 
and the Umarov-Tsallis treatment have also been established, which makes the here reported CLT 
a hopefully convenient tool for understanding the intricacies of the physical processes described 
by power-laws probability distributions, as exemplified, for instance, by the examples reported in 
|5|] (and references therein). 
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